Basic CUDA support #68

WardBrian · 2023-10-06T15:39:23Z

Based off of #64.

Locally, the added type1 cuda tests are passing.
CI will probably take a couple iterations before it is all working.

There's not really much to this: I extended the get_nufft_func helper and switched to manually handling the necessary fftshifts, since the cuda functions don't (currently?) accept modeord

Needs more checks, modeord support?

eickenberg

perfect, looks great to me!

pytorch_finufft/functional.py

eickenberg · 2023-10-06T16:56:30Z

pytorch_finufft/functional.py


        # CPU idiosyncracy that needs to be done differently
-        coord_ramps = torch.from_numpy(np.mgrid[slices])
+        coord_ramps = torch.from_numpy(np.mgrid[slices]).to(points.device)


this will work, but allocates an array on cpu, then sends it to gpu. we may want to borrow from the prior code that uses torch.meshgrid(x_vals, y_vals, z_vals) after allocating x_vals=torch.arange(start, end, device=device) etc. that way it gets created on gpu.

This is second-order optimization probably, since there will likely be other bottlenecks to fix beforehand, so keep as is for now

eickenberg · 2023-10-06T16:57:33Z

pytorch_finufft/functional.py


        grads_points = None
        grad_values = None

        ndim = points.shape[0]

-        nufft_func = get_nufft_func(ndim, 2)
+        nufft_func = get_nufft_func(ndim, 2, points.device.type)


Sending points.device object doesn't work?

BTW do we know anything about how well cufinufft interacts with multple devices?

I'm guessing cufinufft does not like multiple devices, but I haven't tried.

We definitely need more checks that the arrays are both on the same device (at least cpu/cuda, if not also checking they're on the same index of cuda)

Oh and we could use points.device, but the only thing we care about for now is if it is cuda/cpu, so sending the type seemed simplest

pytorch_finufft/functional.py

WardBrian added 7 commits October 6, 2023 10:31

MVP: cuda support for type 1

b50ddcd

Needs more checks, modeord support?

Add (failing) tests

7ca3c6f

First pass at CI

c8c9e0c

Work around modeord issue, cuda forward tests passing

a89b8ef

Skip cuda tests in GHA

6937124

MVP cuda backward

6bd21c2

Minor clean up

fd1f729

WardBrian requested a review from eickenberg October 6, 2023 15:39

WardBrian mentioned this pull request Oct 6, 2023

REF FINUFFT real dtype deprecated #69

Closed

WardBrian linked an issue Oct 6, 2023 that may be closed by this pull request

ENH cuda support #54

Closed

2 tasks

Factor out common test code

b66a421

Base automatically changed from mike-consolidate-dimensionalities-type-1 to main October 6, 2023 16:50

Formatting

052cf41

eickenberg approved these changes Oct 6, 2023

View reviewed changes

WardBrian added 8 commits October 6, 2023 14:10

Consolidate yum installs

8d8f748

CI: Jenkinsfile work

ed5c4e5

CI: Fix email

5492243

Lint fixes

41cab36

Try k40s

2a4f17d

Back to v100

80a0dcc

CI: Build finufft as well as cufinufft

03c79d6

CI: tweaks

f431920

WardBrian merged commit 35fd706 into main Oct 6, 2023
0 of 4 checks passed

WardBrian deleted the feat/cuda-start branch October 6, 2023 22:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Basic CUDA support #68

Basic CUDA support #68

WardBrian commented Oct 6, 2023

eickenberg left a comment

eickenberg Oct 6, 2023

eickenberg Oct 6, 2023

WardBrian Oct 6, 2023

WardBrian Oct 6, 2023

Basic CUDA support #68

Basic CUDA support #68

Conversation

WardBrian commented Oct 6, 2023

eickenberg left a comment

Choose a reason for hiding this comment

eickenberg Oct 6, 2023

Choose a reason for hiding this comment

eickenberg Oct 6, 2023

Choose a reason for hiding this comment

WardBrian Oct 6, 2023

Choose a reason for hiding this comment

WardBrian Oct 6, 2023

Choose a reason for hiding this comment